Combining IR and LDA Topic Modeling for Filtering Microblogs

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Microblogs using Topic Models

As the popularity of micro-blogging increases, managing friends and followers and their tweets is becoming increasingly complex. In this project, we explore the usage of topic models in understanding both text and links in micro-blogs. On a data set of 21306 users, we find that LDA can find good topics that seem to capture meaningful topics of discussion in twitter. We also find that knowing wh...

متن کامل

Microblog Search Task at CLEF 2017: Query Generation using IR and LDA Topic Modeling Combination

The microblogs search task at CLEF 2017 consists of developing a system to search the most relevant microblogs for cultural query in a collection about festivals in all languages. Our general approach to get this objective is the following: we propose to generate from the initial tweet queries, provided for the task, extended queries able to get an answer-rich set of microblogs. This is achieve...

متن کامل

Topic Modeling using LDA with Feedback Mechanisms

Topic models provide a way to identify the latent topics from a collection of documents. Although the identified topics often appear quite representative of the data; just as often, there are parts of the output that appear erroneous or otherwise difficult to interpret by humans. This is a limitation of topic models that can be remedied by user feedback mechanisms. In this paper, I discuss two ...

متن کامل

Entities as topic labels: Improving topic interpretability and evaluability combining Entity Linking and Labeled LDA

Hurvitz, A. (2013). Late Biblical Hebrew, Khan. Khan, G. (ed.) (2013). Encyclopedia of Hebrew Language and Linguistics, Vol. 4, Leiden, Brill, 2013. Kutscher, E. Y. (1974). The Language and Linguistic Background of the Isaiah Scroll (1QIsaa), STDJ 6. Leiden, Brill. Oosting, R., Dyk, J. and Glanz, O., Valence Patterns of Motion Verbs, Semantics, Syntax and Linguistic Variation, to be published. ...

متن کامل

Probabilistic Topic and Syntax Modeling with Part-of-Speech LDA

This article presents a probabilistic generative model for text based on semantic topics and syntactic classes called Part-of-Speech LDA (POSLDA). POSLDA simultaneously uncovers short-range syntactic patterns (syntax) and long-range semantic patterns (topics) that exist in document collections. This results in word distributions that are specific to both topics (sports, education, ...) and part...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Procedia Computer Science

سال: 2017

ISSN: 1877-0509

DOI: 10.1016/j.procs.2017.08.166